List of Flash News about leadership benchmark
| Time | Details |
|---|---|
| 04:02 |
Claude Opus 4.8: Hits 69.2% on SWE-Bench Pro
Claude Opus 4.8 scores 69.2% on SWE-Bench Pro for agentic coding leadership while adding honesty cues and remaining available at prior pricing via EasyRouterIO. |